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Abstract 

Given an undirected n-node unweighted graph G = {y,E), a spanner with stretch func- 
tion /(•) is a subgraph H C G such that, if two nodes are at distance d in G, then they are 
at distance at most f{d) in H. Spanners are very well studied in the literature. The typical 
goal is to construct the sparsest possible spanner for a given stretch function. 

In this paper we study pairwise spanners, where we require to approximate the u-v 
distance only for pairs {u,v) in a given set V <^ V x V. Such P-spanners were studied 
before [Coppersmith,Elkin'05] only in the special case that /(•) is the identity function, i.e. 
distances between relevant pairs must be preserved exactly (a.k.a. pairwise preservers). 

Here we present pairwise spanners which are at the same time sparser than the best 
known preservers (on the same P) and of the best known spanners (with the same /(•))• In 
more detail, for arbitrary V, we show that there exists a 'P-spanner of size 0(n(|P|logn)i/4) 
with f{d) = d + 41ogn. Alternatively, for any £ > 0, there exists a P-spanner of size 

0{n\P\^^^^J~^^^^) with f{d) = (1 + e)d + 4. We also consider the relevant special case that 
there is a critical set of nodes 5 C and we wish to approximate either the distances within 
nodes in S or from nodes in S to any other node. We show that there exists an {S x S)- 
spanner of size 0{n^/\S\) with f{d) = d + 2, and an {S x y)-spanner of size 0{n^/\S\\ogn) 
with f{d) = d + 21ogn. All the mentioned pairwise spanners can be constructed in polyno- 
mial time. 

1 Introduction 

Let G = {V, E) be an undirected unweighted graph. A subgraph i7 of G is a spanner with 
stretch function /(•) if, given any two nodes s,t G V a,t distance Scis^t) in G, the distance 
5h{s, t) between the same two nodes in H is at most /{Scis, t)). An (a, /?) spanner is a spanner 
with stretch functions f{d) = a ■ d + p. (a and /3 are the multiplicative stretch and additive 
stretch of the spanner, respectively). If /3 = the spanner is called rnAiltiplicative, and if a = 1 
the spanner is called purely- additive. Spanners are very well studied in the literature (see Section 

'Partially supported by the ERG Starting Grant NEWNET 279352. This work was partially done while the 
third author was visiting IDSIA. 



1.2). The typical goal is to achieve the sparsest possible spanner for a given stretch function 

/(•) ilElIIIlIiailllllSlIIZlIISlEolES]. 

In this paper we address the natural problem of finding (even sparser) spanners in the case 
that we want to approximately preserve distances only among a given subset V C V xV oi pairs. 
More formally a pairwise spanner on pairs V, or "P-spanner for short, with stretch function /(•) 
is a subgraph HOG such that, for any {s,t) E V, Snis^t) < f{SGis,t)). In particular, a 
classical (all-pairs) spanner is a {V x V^)-spanner. Pairwise spanners capture scenarios where 
we only (or mostly) care about some distances in the graph. 

To the best of our knowledge, pairwise spanners were studied before only in the spe- 
cial case that /(•) is the identity function, i.e. distances between relevant pairs have to 
be preserved exactly. Coppersmith and Elkin [8] call such spanners pairwise (distance) pre- 
servers, and show that one can compute pairwise preservers of size (i.e., number of edges) 

0(min I l^l-y/n, ?t-\/|^|)- 

The authors left it as an open problem to study the approximate variants of these preservers, 
i.e. what we call pairwise spanners here. This paper takes the first step in answering this 
question. We show that (for suitable V) it is possible to achieve P-spanners which are at the 
same time sparser than the preservers in |8j (on the same set V) and than the sparsest known 
classical spanners (with the same stretch function). 

1.1 Our Results and Techniques 

In this paper we present some polynomial-time algorithms to construct (a, /?) 7^-spanners for 
unweighted graphs. Our spanners are either purely-additive (i.e. a = 1) or near-additive (i.e. 
a = 1 + e for an arbitrarily small e > 0). For arbitrary V, we achieve the following main results 
(see Section [5]). 

Theorem 1.1. (near-additive pairwise) For any e > and any V V x V , there is a 
polynomial time algorithm to compute a (1 + e, 4) V-spanner of size 

Theorem 1.2. (purely-additive pairwise) For any integer k > 1 and any V V xV , there 
is a polynomial time algorithm to compute a (l,4fc) V-spanner of size 
0(ni+i/(2fc+i)(^(4/t + 5)|P|)^/(2'=+i)). 

We also consider the relevant special case that all the pairs involve at least one node from 
a critical set S Q V. More precisely, we distinguish two types of such pairwise spanners: in 
subsetwise spanners (see Section [s]) we wish to approximate distances between nodes in S, i.e. 
V = S X S; in sourcewise spanners (see Section |4]) we wish to approximate distances from nodes 
in S, i.e. V = S x V . We obtain the following improved results for the mentioned cases. 

Theorem 1.3. (subsetwise) For any S (^V , there is a polynomial time algorithm to compute 
a (1,2) (5 X S)-spanner of size 0{n^J\S\). 



Theorem 1.4. (sourcewise) For any S and any integer k > 1, there is a polynomial 

time algorithm to compute a {1,2k) (S x V)-spanner of size 0{n^^^/'^'^^^^\k\S\)'^/^'^'^^^'>) . 

In particular, by choosing k = logn, we obtain a (l,21ogn) sourcewise spanner of size 
0{n^\S\ logn), and a (l,41ogn) pairwise spanner of size 0{n{\V\ logn)^/^). 

All our spanners rely on a path-buying strategy which was first exploited in the (1,6) 
spanner by Baswana et al. [Ij. The high-level idea is as follows. There is an initial clustering 
phase, where we compute a suitable clustering of the nodes, and an associated subset of edges 
which are added to the spanner. Then there is a path-buying phase, where we consider an 
appropriate sequence of paths, and decide whether to add or not each path in the spanner 
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under constructioij^ In particular, each path has a cost which is given by the number of edges 
of the path not already contained in the spanner, and a value which measures how much the 
path helps to satisfy the considered set of constraints on pairwise distances. If the value is 
sufficiently larger than the cost, we add the considered path to the spanner, otherwise we do 
not. 

In more detail, all our pairwise spanners exploit the same clustering phase. We compute a 
partition C = {Ci, . . . , Cq} of a subset of the nodes, and call unclustered the remaining nodes 
V — UjCj. The initial value of the spanner is Gc = (y,Ec), where Ec contains all the edges 
of G but possibly a subset of the inter- cluster edges (with endpoints in two different clusters). 
The common clustering phase is described in Section [2] 

During the path-buying phase we add to the spanner some extra inter-cluster edges. Here 
we need to finely tune the sequence of paths that we consider, and also the definition of value of 
a path. In our subsetwise and sourcewise spanners the value of a path p refiects the number of 
pairs {v, C), where v is the endpoint of some pair and C is a cluster, such that adding p to the 
current spanner decreases the distance between v and (the closest node in) C. In the remaining 
pairwise spanners, we use a similar notion of value, but considering the distance between pairs 
of clusters {G',G"). 

The sequence of paths used in our subsetwise spanner and near-additive pairwise spanner 
is simply given by the shortest paths among the relevant pairs. This naturally generalizes the 
set of paths considered in However, for the sourcewise spanner and the purely-additive 
pairwise spanner we need to consider a carefully constructed sequence of paths, which includes 
slightly suboptimal paths. In more detail, we start with the set of shortest paths between the 
relevant pairs. Then, for each such path p, if the cost of p is sufficiently smaller than its value, 
we include p in the spanner. Otherwise, we replace p with a slightly longer path p' between the 
same endpoints which is much cheaper, and iterate the process on p' . After a small number of 
iterations, the considered path becomes cheap enough and hence we include it in the spanner. 
This (non-trivial) iterative construction of candidate paths during the path-buying phase is 
probably the main algorithmic contribution of this paper. 



1.2 Related Work 

Graph spanners were introduced by Peleg and Schaffer [T7] in 1989. Spanners have been ex- 
tensively studied since then, and there are numerous applications involving spanners, such as 
algorithms for approximate shortest paths [U \7\ [12], labeling schemes [16^ I14j . approximate 
distance oracles [211 [6l |3] , routing [H [9l |T0] , and network design [l8j . 

There are several algorithms for computing multiplicative and additive spanners in weighted 
and unweighted graphs. In unweighted graphs, for any integer k > 1, Halperin and Zwick |15j 
gave a linear time algorithm to compute a multiplicative {2k — l,0)-spanner of size 0(n^^^/*^), 
where n is the number of vertices. Note that for k = logn one obtains a spanner with multi- 
plicative stretch O(logn) and with size 0{n): we will use this type of spanner in Theorem |l.l[ 
Analogous results are also known for weighted graphs O [201 [H] • 

The first purely-additive spanner (for unweighted graphs) is due to Dor et al. |llj . They 
describe a (1, 2) spanner of size 0(n^/^ logn). This was subsequently improved to 0(n^/^) |13| . 
Note that our subsetwise spanner from Theorem j 1 .3| generalizes this result: in particular, it has 
the same stretch function and is sparser whenever l^l = o(n). Baswana et al. [3] describe a 
(1, 6)-spanner of size 0{n^/^). Whenever \V\ = 0{n'^^^~^) for some constant 6 > 0, we achieve 
an asymptotically sparser pairwise spanner with constant additive stretch (depending on 6). 
The same holds for our sourcewise spanner if 1 5*1 = 0(n^/^~^). 



^In the spanner from Theorem 1.1 there is also a final step where we add a multiplicative (2 logn, 0)-spanner. 
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The result in [15] shows an elegant trade-off between the size of the spanner and its mul- 
tiplicative stretch. No such trade-off is known for purely-additive spanners. In particular, the 
spanner in [Ij is the sparsest known purely-additive spanner. Theorems 1.2 and 1.4 show a 
non-trivial trade-off between the size and additive stretch of pairwise spanners. 

There have also been several results on near-additive spanners \13\ [T2l I22j . For example, 

4/3 

there is a (1 + e,4)-spanner of size 0(^^^) for any e > [13j. Our pairwise spanner from 
Theorem 1.1 has the same stretch function, and is sparser for \V\ = o(n^/^/(e logn)'^). 

Compared to the preservers in [8] , we achieve sparser pairwise spanners with additive stretch 
O(logn) for \V\ = a;(n^/'^ log^^^ n), and a sparser subsetwise spanners for 151 = tj{n^^^). Inter- 
estingly, our sourcewise spanners are always sparser than the pairwise preservers from |Sj. 



2 Clustering 

A clustering of a graph G = (V, E) is a collection C = {Ci, . . . , Cq} of pairwise disjoint subsets 
of nodes Ci C V . Note that we do not require C to span all the nodes V: we call unclustered 
the nodes V — UiCi. 

We will crucially exploit the following construction of a clustering C and of an associated 
cluster subgraph Gq- 

Lemma 2.1. There is a polynomial time algorithm which, given (3 G [0, 1] and a graph G = 
{V, E), computes a clustering C with at most v}~^ clusters and a subgraph Gc of size 0{n^~^^) 
such that: 

1. (missing-edge property) // an edge uv £ E is absent in Gc, then u and v belong to 
two different clusters. 

2. (cluster-diameter property) The distance in Gc between any two vertices of the same 
cluster is at most 2. 

Proof. Let U be the set of nodes which are not yet clustered (initially we set U := V). As long 
as there exists a vertex v £ V with at least \n^~\ neighbors in U, let G contain exactly \n^~\ 
arbitrary neighbors of v in U. Add G to C, set U := U\G and add to Gc all the edges of G with 
both endpoints in C U {v}. When no node v satisfies the mentioned property, we stop creating 
new clusters and add to Gc all the edges incident to the final set of unclustered nodes U. 

By construction, clusters are pairwise disjoint. Each time we create a new cluster, the size 
of U decreases by at least , hence there cannot be more than n^~^ clusters. Any two nodes 
in the same cluster G have some common neighbor v in Gc, hence Property [2] is satisfied. By 
construction, all the edges incident to unclustered nodes plus the intra-cluster edges (with both 
endpoints in the same cluster) belong to Gc, which implies Property [TJ 

It remains to bound the number of edges of Gc- Each time we create a new cluster, the 
number of edges of Gc grows by at most 0{n^^): this gives 0{n^~^n'^^) = 0{n^~^^) edges 
altogether. When we stop creating clusters, each (clustered or unclustered) node v has at most 

neighbors in U: consequently the number of edges incident to unclustered nodes that we 
add at the end of the procedure is at most 0{n^'^^). □ 

The following technical lemma turns out to be useful in the remaining sections. 



Lemma 2.2. Let C and Gc be constructed with the procedure from Lemma 2.1 w.r.t. a given 
graph G = {V,E). If the shortest path p in G between any two nodes u,v gV contains t edges 
that are absent in Gc, then there are at least t/2 clusters of C having at least one vertex on p. 
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Proof. We prove the lemma by counting pairs {u, e), where e is an edge of p absent in Gq and 
u is one of the endpoints of e: let 5 be the set of such pairs. Since p contains t edges that are 
absent in Gc there are exactly 2t pairs in S (each edge e = uv belongs to two pairs: {u, e) and 
{v, e)). We say that a cluster G £ C owns a pair (u, e) if u G G. By the missing-edge property, 
each edge e of p absent in Gc has both endpoints clustered, hence each pair of S is owned by 
some cluster. 

Let us assume that there are x clusters of C having at least one vertex on p. By the cluster- 
diameter property any cluster C G C contains at most 3 vertices on p, since otherwise p would 
not be a shortest path between u and v. However, if a cluster C € C contains exactly 3 vertices 
on p, those have to be consecutive vertices a, b, c of p, since p is a shortest path and we know 
by the cluster-diameter property that there is a path of length at most 2 between every pair in 
{a, b, c}. By the missing-edge property both edges ab and be are present in Gc, and consequently 
G owns at most two pairs of S. Clearly if a cluster C G C contains at most 2 vertices on p, then 
it owns at most 4 pairs of S. Therefore each cluster owns at most 4 pairs of S: since S has 2t 
pairs we have x > t/2. □ 



3 Subsetwise Spanners 



In this section we present our algorithm to compute a subsetwise spanner, and prove Theo- 
rem [L3l 

Our algorithm consists of two main phases: a clustering phase and a path-buying phase. 



In the clustering phase we invoke Lemma 2.1 and obtain a cluster subgraph Gc of G of size 



0{n^^^), together with a set C of at most n^~^ clusters. The value of /3 will be defined later. 

In the path-buying phase we proceed as follows. Initially set Go := Gc and let {pi, . . . , pz}, 
denote the set of z = ('2') shortest paths between all pairs of vertices in S. We let {ui,Vi) 
denote the endpoints of pi. Next, we iterate over the paths pi for i = 1, . . . , z. To determine 
which paths are affordable, we define the functions value(-) and cost(-): 

• let cost(/3j) be the number of edges of pi that are absent in Gi-i 

• let value(/7j) be the number of pairs {x,G), where x £ {ui,Vi} and C G C is a cluster, 
such that Pi contains at least one vertex of G and the distance between x and G in the 
graph Gi-i is strictly greater than the distance between u and G in pi, i.e., 5Gi_i{x, G) > 
Sp^{x,G). 

Our path-buying strategy is as follows. If 

cost(pj) < 2value(pj) 

then we buy the path pi, that is we set Gi := Gi-i U pi (in words, Gi is given by Gi-i plus 
the edges of pi not in Gi-i). Otherwise (i.e., 2value(pj) < cost(pj)), we do not buy pi and set 
Gi := Gi-i. The subsetwise spanner is given hy Gg '■= Gz- 

The next two lemmas bound the stretch and the size of the constructed spanner Gg, respec- 
tively 

Lemma 3.1. For any {ui,Vi) e V, SaX'^i-.Vi) < Sciui^Vi) + 2. 

Proof. Clearly the claim holds if our algorithm bought the path p,, hence we assume 2value(/jj) < 
cost(pi). Let cost(/Oj) = t, that is there are exactly t edges of pi which are not present in the graph 
Gi-i. By Lemma |2.2| there are at least t/2 clusters having at least one vertex on pi. If there is 
no cluster G among them such that (5gj_i {ui, G) = Sciui, G) and ^Ci-i {vi,G) = Scivi, G), then 
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all these clusters would contribute to value(/9j) (either with Ui or with Vi or both) which leads 
to a contradiction, because t = 2 ■ {t/2) < 2value(pi) < cost(pi) = t. 

Thus there is a cluster C having a vertex of pi such that 6Gi_i{ui,C) = 6G{ui,C) and 
SGi-i{vi,C) = 5G{vi,C). This implies: 

^Gsi'^i^vi) < 6G,_i{ui,Vi) < 5G,_i(?^i,C') +(5G,_i(fj,C) + 2 

< 6G{u^,C) + 6G{vi,C) + 2 

< 5G{ui,Vi) + 2, 

where the first inequality is because Gi-i is a subgraph of Gs, the second inequality holds 
since any two vertices of C are at distance at most two in Gq C Gi-i (by the cluster-diameter 
property) and the last inequality follows from the assumption that G contains a vertex oi pi. □ 

Lemma 3.2. For /3 such that = \/\S\ the graph Gg contains 0{ny^\S\) edges. 

Proof. The clustering phase produces a graph with 0{n^~^^) = 0{n^/\S\) edges. Let B be the 
set of paths bought in the path- buying phase. The total number of edges that appear in B and 
do not appear in Gq is equal to X^p.gg cost(/)j), which is upper bounded by Ylp^;^Q2value{pi). 
Observe that after the first contribution of a pair (x, G) to the above sum, the distance between 
X and G is at most Scix, G) + 2, hence each pair (x, C) can contribute to the sum at most 3 
times. Therefore the total number of edges added in the second phase of our algorithm is upper 
bounded by 0{n^-^\S\) = 0{n^\). □ 



The proof of Theorem |1.3| follows from Lemmas 3.1 and 3.2 



4 Sourcewise spanners 

In this section we present our algorithm to compute a sourcewise spanner from sources 5, and 



prove Theorem 1.4 



Our algorithm again consists of two phases, where the first is a clustering phase and the 
second is a path-buying phase. The clustering phase is as in the algorithm from previous section, 
for a proper value of /3 to be defined later. Let C and Gq be the resulting clustering and cluster 
subgraph. 

At the start of the second phase we set Go := Gc and define as the set of 

shortest paths between any two vertices of V such that at least one of them belongs to S. Let 
us assume that the path pi is a shortest path between Ui G S and Vi £ V. Next, we iterate 
over paths pi for i = 1, . . . ,z. For a given i we are going to define paths p^-, where < j < k, 
maintaining the following invariants: 

(i) pI is a path between m and Vi in G of length at most SG{ui,Vi) + 2j, 

(ii) any cluster G £ C contains at most three vertices of , 

(iii) cost(p^) < 2n^^^/'j^, where cost(/3^) is the number of edges of pj absent in and 

Our algorithm will buy exactly one path p-'- for < j < k, which will ensure (by Invariant (i)) 
that in Gi, the distance between m and Vi is at most 5G{ui,Vi) + 2k. 

We set p-* := pi. Observe that for j = 0, Invariant (i) is trivially satisfied. Invariant (ii) 
is satisfied by the cluster-diameter property (otherwise pi would not be a shortest path), and 
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R 




Figure 1: The solid edges represent a path /o], while the dashed edges denote the new prefix of 
the path p'j^^ ■ 



Invariant (iii) is satisfied because there are at most n^~^ clusters in C and consequently by 
Lemma 2.2 cost(pj) < 2n^~^ . 

Say we have constructed p^, where j € {0, . . . , A;}. Let us define the function value(/3^) as the 
number of clusters C G C such that C contains a vertex of p'^ and the distance between Ui and C 
in is strictly greater than the distance between Mj and Cin/o^, i.e., C) > ^ j{ui-,C). 

Now we check the condition 

cost(/9^) < 37value(/3^). 

If that is the case, then we buy the path p^. That is, Gi is set to Gi-i U /3^. We ignore the 
remaining values of j and proceed with the next value of i. Else we construct p'j^^ as follows: 

Let R be the longest suffix of p'^ containing exactly [cost(/9^)/7j edges that are absent 
in Gi-i. Observe that the first node of R is clustered: by the maximality of i?, the edge 
e of preceding R is absent in Gj_i, and hence both the endpoints of e (one of which is 
the first node of R) are clustered by the missing-edge property of Gc- Consequently at least 
1 + Lcost(/9^)/7j > cost(/?^)/7 vertices of R are clustered, as R contains Lcost(p^)/7j edges 
absent in Gj-i and the endpoints of these edges are clustered. 

By Invariant (ii) there are at least cost (pj)/ (87) clusters in C having at least one vertex of 
R. Since we did not buy p'^ , there exists a cluster G & C containing a vertex x G C oi R such 
that the distance between Ui and G in Gi-i is at most the distance between Ui and x in We 
construct the path p^"*"^ by taking a shortest path in Gi-i from Ui to the closest node y G C, 
then we add a path of length at most two between y and x (which exists in Gq hence in 
by the cluster-diameter property), and finally add the suffix of R starting at x (see Fig. [T]). 

Let us show that p|^^ maintains the invariants. Note that by construction. Invariant (i) is 
satisfied, since the length of p^^^ is at most the length of p^ plus 2. Then, as long as there is a 
cluster G £ C containing at least four vertices on pj^ , we let a, b be the vertices of p^^ closest 
to Ui and Vi respectively. Note that there are at least three edges on p^"*"^ between a and b, 
hence we can replace the subpath of p|^^ by adding the at most two edges of Gc guaranteed by 
the cluster-diameter property. Consequently, Invariant (ii) is satisfied. Moreover, by the choice 
of R, Invariant (iii) is also satisfied. This finishes the construction of p^"*"^. 

Observe that by Invariant (iii) we have cost (pf) < 2/3: since cost(-) has only integral values, 
it has to be that cost(p^) = 0, which ensures that we buy a path pj for some j < k. 

Finally, as our spanner Gs we take Gg := Gz- 

Lemma 4.1. For any pair {ui, Vi) S V , Sc^iui, Vi) < Sciui, vi) + 2k. 

Proof. From the above discussion, we buy at least one path pj for some < j < k. By Invariant 
(i), the length of the latter path is at most the length of the shortest path pi between Ui and Vi 
plus 2k. □ 
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Lemma 4.2. For (3 such that = (n ^/''{2k + 3)|5|)'=/(2fc+i)^ subgraph Gs contains 

Q(„l+l/(2fe+l)(^|5|)fc/(2fc+l)) g^^g^^ 

Proof. To bound the size of Gg we recall that in the first phase we have inserted 0{n^^^) edges. 
Let < ji < k he the index of a path pi' bought for a given i. We claim, that any cluster 
C contributes to value(/9^') of at most 151(2/0 + 3) bought paths. This holds because when for 
Ui £ S a supported path is bought the distance between Uj and C is at most 2k + 2 greater 
than the distance between Ui and C in G: otherwise one could shorten pf by more than 2k, 
obtaining a contradiction with Invariant (i). Therefore the total number of edges added during 
the second phase is upper bounded by J2i=i cost(p^') < Yli=i 37value(/9]') < 37(2A; + 3)|5'|n-'^^^, 
since each cluster C G C supports at most |5|(2A; + 3) bought paths. The claim follows. □ 



The proof of Theorem 1.4 follows from Lemmas |4.1| and |4.2 



5 Pairwise spanners 

In this section we present our pairwise spanners for arbitrary V. We start with a near- additive 



spanner (see Section 5.1) and then present a purely-additive spanner (see Section 5.2). In both 



cases we let V = {(si,ti), . . . , {s]\[,tN)} denote the set of pairs, N = \V\ 
5.1 A Near- Additive Pairwise Spanner 



Our algorithm to construct the near-additive P-spanner from Theorem 1.1 consists of three 
phases. First, we use Lemma [2 . 1 1 with the value of f3 to be determined later, obtaining a cluster 
subgraph Gq of G of size 0(n^^^) together with a set C of at most n^~^ clusters. 

At the start of the second phase we set Go ■= Gc and consider the set of paths {pi, . . . , p^} , 
where pi is a shortest path between Si and ti in G. Next we iterate over the paths pi for 
i = 1, . . . , N. By cost(/9j) we denote the number of edges of pi absent in Gj-i, and by value(/9j) 
we denote the number of pairs of clusters (Ci,C2) G C, such that both Ci and C2 contain at 
least one vertex of pi and (5p.(Ci, C2) < 5Gi_i(C'i) ^^2)- For a given i if 



cost(pi) < Rvalue (pi), 

then we buy pi, that is we set := Gi-i U pi. Otherwise we set Gi := Gi-i. 

In the third phase we add to Gat the multiplicative (2 log n, 0) spanner of size 0{n) given in 
|15j : this way we obtain the desired spanner Gs- 

In the following two lemmas we bound the stretch and size of Gs, respectively. 

Lemma 5.1. For each {si,ti) £ V , Scsisi^U) < (1 +e)5G{si,ti) + 4. 

Proof. Clearly we can assume that the path pi was not bought in the second phase, since 
otherwise the claim trivially holds. Therefore cost (pi) > y^value(/9j). 

Let Pi = {vq = Si,vi,. . . ,ve-i,ve = ti) and let I C {0, . . . ,£} be the set of all indices j 
such that Vj is clustered. Observe that if |/| < 1, then by the missing-edge property the whole 
path Pi is present in Gc, and hence the claim holds. Therefore denote / = {zq, . . • ,iw}, where 
io < ii < . . . < iw and w > 1. Let 0<a<b<whe two indices, such that Vi^ G Gi, Vi^ G G2 
(for some Gi,G2 G C), 6p.{Gi,C2) > (^C-i (C*!) C'2) and the value of 6 — a is maximized. Note 
that such a pair of indices a, b always exists, since we can take a = b. 

Let X = a + {w — b). Observe that any cluster G G C contains at most 3 vertices of 
Vj = {vi. ■ < j < w}, since otherwise by the cluster-diameter property pi would not be a 
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< Sp^{s^,ViJ + 2a\ogn < Sp^{vi^,ViJ + 4 < Sp,{vi,,ti) + 2{w - b)\ogn 



-mti 



Figure 2: Illustration of the three paths concatenation in the proof of Lemma 5.1 



shortest Si-ti path. Therefore there are at least x/6 clusters C having at least one vertex in 
the set {vi- : < j < [x/2]}, and at least x/6 clusters C" having at least one vertex in the set 
{vi. : w — \x/2] < j < w}. However, each of the at least (x/6)^ pairs of clusters in C x C" 
contributes to value(pj) since the difference between indices in the corresponding set is at least 
w - \x/2] + 1 - ([x/2] -l)>w-x. Therefore w > cost(pi) > l^M^x ^nd hence x < 

The latter bound on x is sufficient to prove the claim. In fact, consider the path between Si 
and ti in Gs obtained by concatenating the following paths (as illustrated in Fig. [2]): 

• A shortest path in Gs from Si to Vi^. Note that in the prefix of pi between Sj and Vi^ 
there are a + 1 clustered nodes and hence at most a edges absent in Gi-i (by the missing- 
edge property). Since Gs contains the (2 log n, 0)-spanner added in the third phase, each 
missing edge can be replaced by at path of length 2 log n. Consequently, there is a path 
from Si to Vi^ of length at most 6pi{si,Vi^) + 2a log n in Gs- 

• A shortest path in Gs from Vi^ to Vi^. Let Ci,C2 G C be the clusters containing Vi^ and 
Vi^ respectively. We know that in Gj-i there is a path from Ci to C2 of length at most 
Spi{vi^,Vif^), which can be extended to a path between Vi^ and Vi^ in Gi-i by adding at 
most 4 edges (by the cluster-diameter property). 

• A shortest path in Gs from to ti. Observe that in the suffix of pi between Vi^ and ti 
there are at most w — b edges absent in Gj_i by the same argument as above. Hence, 
thanks to the (2 log n, 0)-spanner added in the third phase, there is a path from Vi^^ to ti 
of length at most 6p^{vi^, ti) + {w — 6)2 log n in Gs- 

The resulting path is of length at most 

^Pr{si,ViJ + 2alogn + 6p^{vi^,Vi,,) + 4 + 6p^{vi^,ti) + {w - 6)2 log n 
= 6G{si,ti) + 2a; logn + 4 < (1 + e)6G{si,t,) + 4, 

where the last inequality follows from x < 2\^n fog^ther with w < dcisijti)- □ 

Lemma 5.2. For (3 such that n'^^ = ^/N^^ the size ofGs is 0{nN^/'^^logn/e)- 

Proof. The clustering phase gives 0{n^~^^) edges, which matches the desired bound on Gs- Let 
B be the set of paths pi bought in the path-buying phase. Observe, that if a pair of clusters 
Ci,C2 contributes to value(pj) of a bought path pi G B, then when pi is bought we have 
SGiiCi,C2) < dG{Ci,G2) + 4, since otherwise the subpath of pi between Gi and G2 might be 
shortened (by the cluster-diameter property). It follows that each pair of clusters contributes 
at most 5 times to value(pi), and hence 

value(pi) < 5(ni-^)2. (1) 
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The total number of edges added in the second phase is therefore upper bounded by 



^ cost ( Pi) < v^value(jOi 

Cauchy-Schwarz 

inequality 12 log n 



< — /2_] value(pi)V]V 



5 l^J5i!!V5n'-^^/^v 

e 

Finally, in the last phase we insert only 0{n) edges when adding the (2 log n, 0)-spanner. □ 
Having Lemmas 5.1 and 5.2, the proof of Theorem 1 1 . 1 1 follows . 



5.2 A Purely- Additive Pairwise Spanner 

In this section we describe an algorithm to compute the purely-additive 7^-spanner from Theo- 



rem 



1.2 To that aim we will combine ideas from the proofs of Theorems 1.1 and 1.4 
Our algorithm consists of the usual clustering phase (for an appropriate parameter /?) fol- 
lowed by a path-buying phase that we next describe. 

Let C and Gc be the clustering and the associated cluster graph. At the beginning of the 
path-buying phase, we set Go := Gc and consider the set {pi, . . . ,pn}, where pi is a shortest 
path between Si and ti in G. Next we iterate over the paths pi for i = 1, . . . ,N. For a given i 
we are going to define paths p^, where < j < k, maintaining the following invariants: 

(i) pI is a path between Si and ti in G of length at most 5cisi, U) + 4j, 

(ii) any cluster C £ C contains at most three vertices of , 

(iii) cost(p^) < 271^^^/^^, where cost(p^) is the number of edges of p^ absent in and 

Our algorithm will buy exactly one path p^, which will ensure by Invariant (i) that in Gi the 
distance between Si and U is at most 6c{ui,Vi) + Ak. By value(p^) let us denote the number 
of pairs of clusters Ci, C2 E C, such that both Ci and C2 contain at least one vertex of p^ and 

5 ,(Ci,C2) <fc,.i(Ci,C2). 

We set p^ := pj. Observe that for j = Invariant (i) is trivially satisfied, Invariant (ii) 
is satisfied by the cluster-diameter property (otherwise pi would not be a shortest path), and 
Invariant (iii) is satisfied because there are at most n^~^ clusters in C and consequently by 
cost(pi) < 2n^-(^. 



Lemma 



2.2 



Say we have constructed p^, where j G {0, . . . ,k}. If 



cost(p]) < 67Yvalue(p^' 
then we buy the path p^, i.e. as Gi we take the union of Gi_i and pj, ignore remaining values of 



j and proceed with the next value of i. Otherwise (i.e., cost(p^) > 67-y/value(p])), we construct 
a path p|^^ as follows: 
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Let = (vq = Si,vi, . . . ,vi_i,vi = ti) and let / C {0,...,^} be the set of all indices 
j such that Vj is clustered. Observe that if |/| < 1, then by the missing-edge property the 
whole path pf is present in Gc, and hence it is of zero cost, which contradicts the assumption 

cost{pi) > 67 value Therefore denote / = {iq, . . . where io < ^1 < • • • < ^u; and 
w > 1. Let 0<a<b<whe two indices, such that Vi^ S Ci, Vi^ S C2 (for some Ci, C2 G C), 
(Ci, C2) > (Ci, C2) and the value of 6 — a is maximized. Note that such a pair of indices 
a, b always exists, since we can take a = b. 

Let X = a + {w — b). By Invariant (ii) there are at least x /6 clusters C having at least one 
vertex in the set {vi- : < j < \x/2\}, and at least x/6 clusters C" having at least one vertex 
in the set {vi- : w — \x/2] < j < w}. However, each of the at least (x/6)^ pairs of clusters in 
C X C" contributes to value(/9^) since the difference between indices in the corresponding set is 
at least w — \x/2\ + 1 — ([x/2] — 1) > w — x. Therefore 



cost(p^) > 67Yvalue(/3^) > 67^ (^-j = j x =^ x < cost{pi) / j . (2) 

We construct the path by appending the following three paths A, B, and C: 

• As A we take the prefix of pi from Sj to Vi^ . Note that this prefix contains a + 1 clustered 
nodes and hence at most a edges absent in (by the missing-edge property of Gq)- 

• Let Ci,C2 S C be the clusters containing Vi^ and Vi^ respectively. We know that in Gi-i 
there is a path from Ci to C2 of length at most 6 3 {vi^ ,Vi^), which can be extended to a 
path B between Vi^ and Vi^^ in Gi-i by adding at most 4 edges (by the cluster-diameter 
property) . 

• As C we take the suffix of pi from Vi^ to ti , which contains at most w — b edges absent in 
Gi-i by the same argument as above. 

Observe that pi~^^ contains at most a + {w — b) = x edges absent in hence by ^ we 

ensure Invariant (iii). Moreover the length of pi~^^ is at most the length of pi plus 4, which 
ensures Invariant (i). In order to ensure Invariant (ii), as long as there exists a cluster G £ C 
containing at least 4 vertices of p^"*"^ we let u and v be two such vertices closest to Si and ti on 
pi~^^ respectively and replace the subpath of p^"*"^ between u and v (which is of length at least 
three) by a path of length at most two in Gi-i (which exists by the cluster-diameter property). 

Observe that by Invariant (iii) we have cost(p^) < 2/3, hence cost(p^) = which ensures 
that we buy a path pP- for some j < k. Finally, as our spanner Gs we take Gs ■= Gn- 

Lemma 5.3. For each (sj,tj) G V, 6cs{si,ti) < Scisi^ti) + 4,k. 

Proof. From the above discussion, Gg contains at least one path pP^ between Si and ti for some 
< j < k. The claim follows by Invariant (i). □ 

Lemma 5.4. F or (3 such th at = (6ni/'^V(4fc + 5)Af)''/(2fc+i) the size ofGs is 
0(^i+i/(2fc+i) (y(4^^5PV)fe/(2fc+i)) ^ 

Proof. The clustering phase gives 0[n^^f^) edges, which matches the desired bound on Gs- 
Let < jj < A; be the index of a path /t^' bought for a given i. We claim, that any pair of 
clusters contributes to value(p^') of at most {4k + 5) bought paths. Observe, that if a pair 
of clusters Ci,C2 contributes to value{pl^), then when p^* is bought we have 5GiiGi,C2) < 
5g{Ci, C2) + 4/c -|- 4, since otherwise the subpath of p^' between Ci and C2 might be shortened 
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by more than 4A:, contradicting Invariant (i). The total number of edges added in the second 
phase is upper bounded by 



cost(/9f) < 67yvalue(/^' 



l<2<iV l<i<Af 

Cauchy-Schwarz 

inequality I ^ '. 

Y l<i<Ar 

By substituting 7 and /3 the claim follows. □ 
Theorem |L2J follows from Lemmas [53] and [53 



6 Conclusions 

We considered a natural extension to the problem of computing a sparse spanner in an undi- 
rected unweighted graph. Along with the input graph G = (y,E), a subset V V x V oi 
relevant pairs of vertices is also given here and we seek a sparse subgraph H ol G such that 
for every pair (u, v) in "P, the u-v distance 6h{u,v) in the subgraph is close to the u-v distance 
6g{u, v) in G. We showed sparse subgraphs H where 5h{u, v) is a small additive or near-additive 
stretch away from 5g{u,v). 

The pairwise preservers in [8] are at the same time more accurate and sparser than our 
spanners for small enough values of {Vl- In particular, in that range of values of \V\ the authors 
exploit a construction which does not seem to benefit from allowing a larger stretch. The authors 
also present lower bounds on the size of any preserver, however it is unclear whether those 
lower bounds extend to the case of pairwise spanners (where distances have to be approximated 
rather than preserved). Obtaining sparser pairwise spanners for very small \V\, if possible, is 
an interesting open problem. 
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